A front-page news-selection algorithm based on topic modelling using raw text
نویسندگان
چکیده
منابع مشابه
A front-page news-selection algorithm based on topic modelling using raw text
Front-page news selection is the task of finding important news articles in news aggregators. In this study, we examine news selection for public front pages using raw text, without any meta-attributes such as click counts. A novel algorithm is introduced by jointly considering the importance and diversity of selected news articles and the length of front pages. We estimate the importance of ne...
متن کاملFront page news
Science reporters always have a lot to think about between the time they receive press materials from a journal and the time they have to decide whether to play up an advance or ignore it altogether. Journals, universities and companies do their best to be good salespeople — to market their product for mass consumption. Science reporters, too, are looking for juicy stories. But they don’t want ...
متن کاملA Text Mining Research Based on Lda Topic Modelling
A Large number of digital text information is generated every day. Effectively searching, managing and exploring the text data has become a main task. In this paper, we first represent an introduction to text mining and a probabilistic topic model Latent Dirichlet allocation. Then two experiments are proposed Wikipedia articles and users’ tweets topic modelling. The former one builds up a docum...
متن کاملNews Selection with Topic Modeling
There are numerous news articles coming to news aggregators and important news are selected to be presented on the front-page. There are two types of news selection for the front-page of news aggregators: personalized and public news recommendation (selection). This study examines public news recommendation that aims to satisfy all users’ interest on the front-page. Public news recommendation i...
متن کاملA Novel Approach to Feature Selection Using PageRank algorithm for Web Page Classification
In this paper, a novel filter-based approach is proposed using the PageRank algorithm to select the optimal subset of features as well as to compute their weights for web page classification. To evaluate the proposed approach multiple experiments are performed using accuracy score as the main criterion on four different datasets, namely WebKB, Reuters-R8, Reuters-R52, and 20NewsGroups. By analy...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Journal of Information Science
سال: 2015
ISSN: 0165-5515,1741-6485
DOI: 10.1177/0165551515589069